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Sensor placement by maximal projection on 
minimum eigenspace for linear inverse problems 
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Abstract —This paper presents two new greedy sensor place¬ 
ment algorithms, named minimum nonzero eigenvalue pur¬ 
suit (MNEP) and maximal projection on minimum eigenspace 
(MPME), for linear inverse problems, with greater emphasis on 
the MPME algorithm for performance comparison with existing 
approaches. In hoth MNEP and MPME, we select the sensing 
locations one-by-one. In this way, the least number of required 
sensor nodes can be determined by checking whether the estima¬ 
tion accuracy is satisfied after each sensing location is determined. 
For the MPME algorithm, the minimum eigenspace is defined 
as the eigenspace associated with the minimum eigenvalue of 
the dual observation matrix. For each sensing location, the 
projection of its observation vector onto the minimum eigenspace 
is shown to be monotonlcally decreasing w.r.t. the worst case 
error variance (WCEV) of the estimated parameters. We select 
the sensing location whose observation vector has the maximum 
projection onto the minimum eigenspace of the current dual 
observation matrix. The proposed MPME is shown to be one 
of the most computationally efficient algorithms. Our Monte- 
Carlo simulations showed that MPME outperforms the convex 
relaxation method flj, the SparSenSe method and the 
FrameSense method in terms of WCEV and the mean square 
error (MSE) of the estimated parameters, especially when the 
number of available sensor nodes is very limited. 

Index Terms —Linear inverse problem, sensor placement, 
greedy algorithm, rank-one modification, local optimization. 


I. Introduction 

S ensor networks are widely used for monitoring 
temporal-spatial physical fields. While each sensor node 
can only observe the field intensity (e.g., temperature, hu¬ 
midity, concentration of contaminant, etc.) of a particular 
location, with a network of sparse sensor observations, a 
physical field of interest may be reconstructed by solving a 
linear inverse problem 0-®. In physical field estimation, the 
number of sensor nodes and their spatial locations are closely 
related to the coverage, cost, battery energy consumption, and 
even the error of the estimated physical field. Therefore, the 
determination of the least number of required sensor nodes 
and their locations is critical in sensor network design. 
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For a linear inverse problem, sensor placement is to seek the 
least number of required sensor nodes and their corresponding 
sensing locations within a known spatial domain such that the 
estimation accuracy can meet the requirement. Specifically, 
assuming that the observation models of all potential sensing 
locations are known, we want to determine the least number 
of required sensors with which the physical field of interest 
can be recovered within a predefined accuracy. Obviously, one 
straightforward method is to evaluate the performance of all 
possible combinations of all potential sizes of the candidate 
sensing locations, and then select the one with the least number 
of sensor nodes that satisfies the required estimation accu¬ 
racy. But such a combinatorial approach is computationally 
intractable. In practice, direct enumeration is impossible if the 
number of potential sensing locations is large. Apart from the 
enumeration method, the optimal solution can also be obtained 
by branch-and-bound methods ®, pO) , which unfortunately 
do take a very long time, even for a moderate scale problem 
®. Consequently, in recent years the sensor placement for 
the linear inverse problem has attracted increasing attention to 
find a suboptimal solution via computationally more efficient 
methods Q-®, im-iii). 

A. Related prior work 

Heuristics have been proposed to reduce the cost of exhaus¬ 
tive search. The simplest one is to place sensor nodes at the 
spatial maxima and minima of proper orthogonal components 
of the physical field of interest ®. This method is simple 
but only suitable for some special cases 0 - Other heuristics 
include genetic algorithms particle swarm optimizer 
tabu search 0, and cross-entropy optimization IB- They all 
involve a prohibitive computational cost and the solutions have 
no optimality guarantee. 

Joshi and Boyd ® formulated the sensor placement prob¬ 
lem as an elegant nonconvex optimization problem, and ap¬ 
proximated it as a convex optimization problem by the relax¬ 
ation of the nonconvex Boolean constraints that represent the 
sensor placements, to a convex box set. This convex relaxation 
was then used in many works 0, in), 0-0. The sensing 
locations can be easily determined based on the solution of the 
convex optimization problem. But the sensor placement may 
lead to an ill-conditioned observation model due to the gap 
between the nonconvex and the convex optimization problems, 
especially when the number of sensor nodes is very limited. 
Such a result has been shown to be no better than other works 
0, 0, 0. However, the authors in ® provided a local 
optimization technique to improve the result. This technique 
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is computationally expensive but some numerical examples 
showed that with the local optimization, the convex relaxation 
method can indeed provide good results. 

The sensor placement problem was also solved by some 
greedy algorithms in which the sensor locations are individu¬ 
ally determined by optimizing some proxies of the error of the 
estimated physical field, such as the determinant of Fisher in¬ 
formation matrix fl^ , and the condition number ®-i) or the 
frame potential |[^ of the observation matrix. The rj-confidence 
ellipsoid of the estimation error depends on the determinant of 
the Fisher information matrix |[T], which was optimized using 
one greedy method in |12|, but it is shown to be no better 
than other methods in the examples in 0. For the sensor 
placement problem, the minimum requirement of the solution 
is that the observation model should be well-conditioned. 
Therefore, some researchers determined the sensing locations 
by minimizing the condition number of the observation matrix 
0-®- However, the condition number is a concept for 
nonsingular matrix, and we need to firstly determine a group 
of sensing locations to guarantee that the observation matrix 
is nonsingular 0^ which is unfortunately a combinatorial 
problem. Additionally, the minimum condition number of the 
observation matrix does not mean the minimum estimation 
error except when all the observation vectors have the same 
norm because the sensing energy should be considered, which 
is related to the signal-to-noise ratio. Recently, Ranieri et al. 

provided a novel greedy algorithm by minimizing the 
frame potential of the observation matrix. This method is 
computationally efficient but; 1) like the condition number 
minimization, it is only effective for the case where all the 
observation vectors have the same norm; 2) it cannot guarantee 
that the observation matrix is well-conditioned. 

All the above mentioned works focused on the case where 
the number of sensor nodes is fixed. One sparse-promoting 
technique has been used to minimize the number of required 
sensor nodes by adding a sparsity-promoting penalty term 
to the cost function ©■ This method works well when the 
dimension of the estimated parameter is small (e.g., the 
dimension is set as 2 in the example of Ref. [2]). However, 
if the dimension of the estimated parameter is large (e.g., a 
few tens, which is very common in fluid held reconstruction 
problems Q, |j7|, ||20|), this method will be ineffective in 
determining the least number of required sensor nodes, which 
will be discussed in detail later. 

Besides the sensor placement for linear inverse problems, 
many other excellent sensor placement works have focused 
on the continuous system fTT) , nonlinear model | [T^ , energy 
saving |^, jTS) , state estimation for dynamic system |18 
113’ Fli~P3|, and Gaussian process interpolation p^-| 27 


vector onto the eigenspace of the minimum eigenvalue of 
the current dual observation matrix. It is shown that such a 
projection is monotonically decreasing w.r.t. the worst case 
error variance (WCEV). 

Compared with the state-of-the-art, the proposed greedy 
algorithm which we call the maximal projection on minimum 
eigenspace (MPME), has the following advantages; 

• The MPME can readily determine the minimum number 
of required sensor nodes. 

• The MPME outperforms the convex relaxation method 
0, the SparSenSe method Q, and the ErameSense 
method 0 in terms of the WCEV and the mean square 
error (MSE) of the estimated vector, especially when the 
number of available sensor nodes is very limited. 

• The MPME can guarantee that the observation ma¬ 
trix is well-conditioned but the convex relaxation, the 
SparSenSe and the ErameSense methods cannot guar¬ 
antee such a condition, especially when the number of 
available sensor nodes is very limited. 

• Eor general sensor placement problems, the MPME with¬ 
out local optimization 0 outperforms the state-of-the-art 
with local optimization. 

• The proposed MPME is computationally one of the most 
efficient sensor placement algorithms. 


C. Outline and notations 

The rest of this paper is organized as follows. In Section 
II, we introduce the linear inverse problem and the sensor 
placement problem, and briefly review three current methods. 
In Section III, we develop the MPME algorithm. We then 
provide four examples to compare the effectiveness of MPME 
with the current methods via Monte-Carlo simulations in 
Section IV. In Section V, we analyze the computational cost of 
the MPME algorithm and compare it with those of the current 
methods. The conclusions are given in Section VI. 

This paper uses the following notations; Upper (lower) 
bold letters, e.g. A (a) or €> (t^), indicate matrices (column 
vectors). I represents an identify matrix with proper dimension 
whose *-th column vector is denoted by e^. 1 is a vector of 
proper dimension with all entries one. (-j^, E(-), tr(-), 
11 • |j, det(-), span(-), null(-), dim(-), and rank(-) are re¬ 
spectively the transposition, pseudo-inverse, expectation, trace, 
norm, determinant, spanned space, null space, dimension, and 
rank operators. 


H. Problem Statement 
A. Linear inverse problem 

We consider a physical held f G described as 


B. Our contributions 


f = $q: 


( 1 ) 


In this paper, we propose a new greedy algorithm to 
minimize the number of required sensor nodes and determine 
their locations for the linear inverse problem such that the 
estimation error meets the requirement. We determine the 
sensing locations one-by-one until the estimation accuracy is 
satisfied by maximizing the projection of each observation 


where a G K" is a vector of parameters to be estimated with 
n N, and $ G is a known full column-rank matrix, 

which we call the signal representation matrix and its column 
vectors compose a basis of the physical held. 

It is expensive and impractical to sense the physical field f 
with N sensor nodes since N is very large and depends on the 
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resolution of the discrete physical space 0- However, part of 
the physical field can be observed from sensor networks, i.e. 

y = Hf + u — + u (2) 

where H S ^mxn ^vj^ose z-th row is ej., Si G N' = 
corresponds to the z-th sensing location, and 
M{n < M N) is the number of sensor nodes. The 
observation matrix 

$ = H$ = 

is a pruned matrix from the rows of $ indexed by 
{si, S 2 , sm}, and (pj is the s^-th row of $ and represents 
the observation model of the z-th sensor node, which we call 
the observation vector. The measurement noise i/ G is 
assumed to be zero-mean i.i.d. Gaussian random process with 
variance 

From 0, we can obtain the following minimum variance 
unbiased estimate (MVUE) 


B. Sensor placement problem 

We denote the set of selected sensing locations by 5 = 
{si, S 2 , Sm}, and the set of potential sensing locations by 
Af = {1, 2,TV}, which correspond to the row indices of $ 
and #, respectively. Then, we formulate the following sensor 
placement problem. 

Problem 1: Given the signal representation matrix $ = 
G select M rows of $ indexed 

by si,S 2 ,...,sm to construct the observation matrix $ = 
G such that the error of the 

estimated parameters a in 0 is small enough and the number 
of rows of $, i.e. M, is minimized. 

This actually is a sensing location selection problem. We 
aim to find the minimum number of sensing locations with 
which the error of a is less than a predefined threshold. In this 
paper, we use the WCEV as the error indicator, and equation 
0 shows that a small WCEV can guarantee a small MSE. 
Then, this sensor placement problem can be formulated as the 
following cardinality minimization problem 


a = (3) 

where is the pseudo-inverse of 4>. The 

mean square error (MSE) of this MVUE 0-0 is 

n 

MSE(a) = E (||a - a\\l) = cr2tr('5'-i) = ^ — (4) 

fc=i 

where Ai > A 2 > ... > A„ stand for the eigenvalues of 


which we call the dual observation matrix. 

With some standard operations, we can obtain the variance 
of q; as 


S = E [(a — Q;)(a — a)"’’] 

= E 

Then, we introduce the following worst case error variance 
(WCEV) of the MVUE a 

2 

WCEV(q:) = max x'^'Sx = Amax(5]) = (5) 

l|x||2 = l A„ 

Eor more detail about WCEV, do refer to JH. Since 
= \ it is easily found from 0 and 0 that 

MSE(a) = a2||$t||| 

WCEV(a) = o-2|!$^|!2 

Consequently, the two error indicators are equivalent due to 
the equivalence of the two matrix norms | |28) . Specifically, 

WCEV(a) < MSE(a) < nWCEV(a) (6) 

It is clear in 0 and 0 that both MSE and WCEV depend 
on the eigenvalues of the dual observation matrix \I/, which 
fully depends on the sensing locations. 


S = arg min |5| subject to Xn > 1 (7) 

SCAT 

where | • | returns the cardinality of a set, and 7 corresponds 
to the maximum acceptable WCEV. 


C. The state of the art 

The combinatorial optimization problem 0 is NP-hard 
|29|. Here, we briefly review three current and related meth¬ 
ods. Two of them are originally designed for the case where 
the number of available sensors is fixed. However, they can 
be simply modified and applied to the case where the number 
of sensors is unknown, which is discussed in Remark 1. 

1) Convex relaxation 0-- If the number of sensor nodes is 
fixed, the sensor placement problem can be formulated as 


maximize 
subject to 


N 


log det 


l^w = M 

Wi G {0,1}, i G N 


( 8 ) 


with variable w G Here Wi = 1 means i G S and Wi = 0 
means i S. Performing a convex relaxation, i.e. replacing 
the nonconvex Boolean constraints Wi G {0,1} by Wi G [0,1], 
we can obtain the following convex optimization problem: 

/ N 

maximize log det j 

\i=l 

subject to = M 

zu, G[0,1], iGM (9) 


with variable w. This problem can be solved by the interior- 
point methods lig. Rearranging the entries of the solution 
of the relaxed problem 0, i.e. w*(G [0,1]^), in descending 
order yields the sequence Then, the set 

of the sensing indices is given by 5 = {si, § 2 ,. ■ ■, Sm}, be. 
the indices of the M largest elements of w*. 
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2) SparSenSe To determine the number of required 
sensor nodes, the following convex optimization, called sparse- 
aware sensor selection (SparSenSe), is formulated: 


minimize 
subject to 


l!w||zi 

T 


^ 0, j = 


||x||ii < 7 ', Xj > 0 , j = 1 , ...,n 


Wi G [0,1], i&M 


1, ...,n 

( 10 ) 


where 7 ' corresponds to the maximum acceptable MSE index. 
This is a linear matrix inequalities problem and can be solved 
by using the CVX toolbox With the solution w* and a 
prior threshold t (0 ^ r <C 1 ), we can determine the sensing 
indices. If w* < r, set w* = 0. The number of required sensor 
nodes is the number of nonzero entries of w*. 

3) FrameSense The ensemble of the rows of a matrix 
can be viewed as a frame. If all the observation vectors have 
the same norm, according to the frame theory, the observation 
matrix €> achieves the minimum MSE when it achieves the 
minimum frame potential | p^ . Eor the basic concept of the 

frame theory, do refer to |^. The sensor placement problem 
can be solved by minimizing the following frame potential 


( 11 ) 

ijeS 

One greedy “worst-out” algorithm, called the ErameSense, 
can provide a near-optimal solution in the sense of the mini¬ 
mum frame potential. At each step, it removes the row of $ 
that maximally increases the frame potential. If $ corresponds 
to an equal-norm frame, the row index is in fact the index of 
the row/column of which has the largest 1 -norm. 

Here, denotes a matrix whose entries are the square of the 
corresponding entries of A. 

Remark 1: With simple modifications, convex relaxation 
and ErameSense can be used to determine the least number 
of required sensors. Eor the convex relaxation method, it can 
be found by increasing the sensor number from n until the 
constraint in (0 is satisfied. Eor ErameSense, when removing 
each row of we check the constraint in (0. If the constraint 
is not satisfied, reserve the row and the number of remaining 
rows of $ is the least number of required sensors. 


III. Maximal Projection on Minimum Eigenspace 

As mentioned before, one apparent method to solve the 
cardinality optimization problem 0 is to evaluate the min¬ 
imum eigenvalue of the dual observation matrix (i.e. A„) of 
all potential sensor configurations, and then find the configu¬ 
ration with the minimum number of sensors that satisfies the 
constraint. Unfortunately, the computational cost of exhaus¬ 
tively searching 2^ potential configurations is unaffordable 
for large scale problems. One simple strategy to reduce the 
number of searched sensor configurations is to determine 
the sensing locations one-by-one. With such a strategy, the 
minimum number of required sensor nodes, M, can be easily 
found by judging whether the constraint in 0 is satisfied 
after each sensing location is determined. In this way, the 


number of searched sensor configurations can be reduced to 
—i), since at the first step over N possible sensing 
locations are searched, then N — and so on. 

Admittedly, each sensor node may have correlated influence 
with others, and one sensor reading is informative for a given 
sensor configuration but may be meaningless for others. When 
finding the sensing locations one-by-one, we do not know 
the contribution of each sensor node for the final sensor 
configuration; therefore, such a strategy cannot guarantee the 
optimal solution. However, we are trying to make a tradeoff 
between the computational cost and the number of required 
sensor nodes, i.e. to And an effective sensor configuration with 
proper number of sensor nodes by determining the sensing 
locations one-by-one. 

When determining the sensing locations one-by-one, we can 
obtain an observation vector sequence Eor simplic¬ 
ity, we introduce a new matrix G to denote the first 

k{l < k < M) rows of $, i.e. 

corresponds to the first k sensing locations. Corresponding to 
we can obtain the matrix sequence and 

the dual observation matrix sequence Here = 

has a nonincreasing eigenvalue sequence 
The notations that will commonly appear in this paper are 
listed below for easy reference. 


the k-th selected obseiwation vector corresponding to the fc-th 
sensing location, is the k-ih row of ^ and s^-th row of 
includes the first k observation vectors. It consists of the first k 
rows of the observation matrix ^ = ^m- 

is the dual observation matrix associated with 
(k) 

\) ^ the 2 -th eigenvalue of i.e. Aj(^fc). 

(k) (k) 

^ the normalized eigenvector of associated with . 

(fe—1) 

(In the multiplicity of the minimum eigenvalue A^ ^ w.r.t. 

An the minimum eigenvalue of 


(k) 

For all k < n, the minimum eigenvalue of ’4'^, AA = 0 
because rank(fl/j;) = rank(4>fc) < k. Therefore, the number 
of required sensor nodes M must be no less than n. Since $ 
is a full column-rank matrix, the nonsingular exists, which 
implies that the optimal M may be equal to n. Consequently, 
to minimize M we should guarantee that is nonsingular. In 
that case, the vectors in are mutually independent 

and rank(\I/fc) = rank(4>fc) = rank({<^g.}^^]^) = k for 
all fc < n. In practice, we can easily guarantee that is 
independent with the vectors in when determining 

the /c-th sensing location for all k < n. 

Our purpose is to And the shortest observation vector 
sequence by determining the sensing locations one- 

by-one, such that the constraint in 0 , i.e. An > 1 , is 
satisfied. Eor a given sensor configuration corresponding to 
< k < M), we need to formulate some guidelines 
to determine the next sensing location Sk- Intuitively, we can 
traverse all A—A:-|-l unselected observation vectors to find the 
one that can maximally increase A„. Then, the other sensing 
locations can be similarly determined one-by-one until A„ > 7 
and meanwhile the number of selected sensing locations is the 
minimum number of required sensor nodes, i.e. M. 

However, the challenge is that we do not know how 
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affects A„(= An^^) since M and {v’sili'ife+i are unknown 
when we find the fc-th sensing location. In other words, 
we cannot build an explicit mapping between and A„. 
Therefore, it is impossible to find the optimal /c-th sensing 
location s\ by directly optimizing A„. 

Our main idea is to find a new criterion instead of A„. 
When determining the k-th sensing location, we optimize the 
new criterion. The criterion should satisfy three conditions: 

1) It can be directly obtained from and 

2) Optimizing the criterion can guarantee that all the vec¬ 
tors in are mutually independent. 

3) An is positively correlated with the criterion. 

The first condition implies that we can directly assess the 
contributions of all potential observation vectors for the new 
criterion. The second condition guarantees that is nonsin¬ 
gular, and the last condition guarantees that increasing the new 
criterion can increase A„, i.e. decrease the WCEV. If such a 
criterion exists, we can determine the sensing locations one- 
by-one by maximizing the new criterion. Accordingly, we can 
find a suboptimal sensor configuration. In what follows, we 
shall present two alternative criteria. 


is positively correlated with \^\ Since A„ = A^^^ > 
A^m- 1 ) > _ _ > > \n \ A„ is positively correlated 

with A^*^ for any n < k < M. 

For any k < n, ( |T3] l shows that a[.^^ is the upper bound of 
Therefore, if we select sensing location to maximize 
a[,^\ we maximize the upper bound of Ai"^ Actually, A^^^^ 
is monotonically increasing w.r.t. A^ for any k < n, which 
will be shown later. Hence, An"^ is monotonically increasing 
w.r.t. A^^^ for all k < n. Since A„ = aI^^ is positively 

correlated with we conclude that A„ is also positively 

ik') 

correlated with A^ for any k < n. 

In summary, A„ is positively correlated with the minimum 
nonzero eigenvalue of i.e. a[,^^ for fc < n and for 
k < n, which satisfies the third condition. 

Therefore, the minimum nonzero eigenvalue of ’4'^ can be 
a criteria in place of An to optimize the /c-th sensing location. 
To maximize A„, we can select to maximize a[,^^ when 
k < n and maximize A^^^ when k > n. The greedy algorithm 
for the sensor placement problem named minimum nonzero 
eigenvalue pursuit (MNEP) is summarized in Algorithm [T] 


A. Minimum nonzero eigenvalue pursuit (MNEP) 

It can be easily found that 

This equation implies that all the eigenvalues of satisfy 
the first condition of the criterion used to replace A„, i.e. the 
eigenvalues of can be found if ^k-i and p^^ are known. 
Amongst all the eigenvalues of ^k, we guess that the minimum 
nonzero eigenvalue, i.e. A^ when k < n and AA when k > 

n, is one choice of the criterion to be used to replace A„. 

ik) 

Obviously, for any k < n, maximizing A^ can guarantee 

(k) 

that A^ > 0, which implies that for any k <n, the vectors in 
{Ps }i=i are mutually independent. Therefore, the minimum 
nonzero eigenvalue of ’4'^. satisfies the second condition. We 
then utilize the following theorem to show that it also satisfies 
the third condition. 

Theorem 1: Suppose B = A -f cc^ where A G is 

symmetric, and c G M" is a non-zero vector. Then, 


Ai(B) > Ai(A) > A2(B) > A2(A) > ... > A„(B) > A„(A) 


Proof: See p4) and the Theorem 8.1.8 in p8[ . ■ 

Since '4'^ = + p^^p^^, considering Theorem^ we 

can obtain 


Al'^^ > for all k>n 

A^^^ < A^^^^ for all k <n 


From the two equations, we can easily find that 

A^"^ > > ... > a1") (12) 

Ai”) < < ... < A<^^ (13) 

For any n < k < M, (fT^ shows that xit^ is the lower 
bound of AA . If we maximize AA by proper selection 
of , we maximize the lower bound of Hence, 


Algorithm 1: minimum nonzero eigenvalue pursuit 

Input: $ = [p^,P 2 ,..., 

Output: $ G 5, M 

1 Initialization: Af = {1,2,..., N}, 5 = 0. 

2 Determine the first n — 1 sensing locations: 

(a) 4>o = [ ], fc = 1. 

(b) Sfc = arg max Xk{^J_^^k-i + 

ieN\S 

(c) Update: 5 = 5 U {sk}, ^k = [^Li 

(d) Set k = k + 1 and repeat step (b-c) until k = n. 

3 Determine the remaining sensing locations: 

(a) Sk = arg max Xn{^l_i^k-i + PiPj)- 

ieN\S 

(b) Update: 5 = 5 U {4}, ^k = [*Li 

(c) If aI*^ > 7 return S, M = k, and $ = else set 
k = k \ and repeat step (a-b). 


We determine the sensing locations one-by-one. For the 
first n — 1 sensors, the fc-th sensing location can be obtained 
from the optimization problem in step 2(b). To solve this op¬ 
timization problem, we traverse all the unselected observation 
vectors and find the one that maximizes A^ . After n — 1 
sensing locations have been determined, we find the remaining 
sensing locations by solving the optimization problem in step 
3(a), which is similar to the previous one but maximizes AA , 
i.e. the minimum eigenvalue of 4/^. Meanwhile, we check the 
constraint in 0 after each sensing location is determined. If 
the constraint is satisfied, stop the algorithm. 

When determining the fc-th sensing location, we need to 
traverse N — k + 1 rows of $, and evaluate the minimum 
nonzero eigenvalue of ’4'f; for each case. Solving eigenvalue 
problems for all A^—fc-l-1 cases is computationally expensive. 
Can we find a simpler alternative criterion for the observation 
vectors in {Pi\f^i to avoid solving eigenvalue problems for 
all possible dual observation matrices? 
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In what follows, we provide another criterion, i.e. the mag¬ 
nitude of the projection of onto the minimum eigenspace 
of and it is shown to satisfy the three aforementioned 

conditions. Compared with the MNER, the greedy algorithm 
via optimizing the new criterion is computationally more 
efficient and effective. The dehnition of minimum eigenspace 
will be given later. 

B. Maximal projection on minimum eigenspace (MPME) 

Sensor placement is about selecting proper observation 
vectors to guarantee that the eigenvalues of meet cer¬ 
tain requirements. To understand how the observation vectors 
{v’s affect the eigenvalues of we introduce the 
following theorem. 

Theorem 2: For any matrix the symmetric 

matrix has a nonincreasing eigenvalue sequence 

and 


optimal solutions, which are exactly the normalized eigenvec¬ 
tors associated with the minimum eigenvalue of i.e. 

,(fe-i) _ ,(fc-i) _ _ 

For any k < n, the projection of any vector in {<Ps.}iTi 
onto the null space of i.e. null(€>fc_i), is zero. It is 

clear that the square summation of the projections of {<Ps .}iTi 
onto any other subspace except the subspace of null($fe_i) is 
nonzero. Therefore, null($fe_i) is exactly the subspace with 
the highest dimension onto which the square summation of 
the projections of all vectors in {<Ps }i=i minimum. 

For simplicity, we introduce a new concept, the minimum 
eigenspace, as follows. 

Definition 1: For any positive semi-dehne symmetric ma¬ 
trix A e with the nonincreasing eigenvalue sequence 

{Ai(A)}"^j^, the minimum eigenspace of A is the eigenspace 
associated with all the minimum eigenvalues of A, i.e. 

Ufc:n(A) = span(ufc, u^+i,..., u„) 


i=i 

(k) (k) 

where ^ is the normalized eigenvector associated with . 
Proof: See Appendix A. ■ 

In ( [T^ , represents the magnitude of the projection 

of ipg. onto the eigenspace span(u(^(), which is associated 

^ (k) 

with the eigenvalue A^ . Theorem 2 shows that the eigenvalue 
of ’S'fc equals the square summation of the projections of all 
columns of onto its eigenspace. 

(k) 

For any k > n, the minimum eigenvalue of ’4'^, i.e. AA , 
equals the square summation of the projections of {Ps }^=i 
onto the eigenspace associated with the minimum eigenvalue. 
However, before the fc-th sensing location is determined, 
the eigenspace of is unknown. Therefore, to assess the 
contribution of the all unselected observation vectors for An ^ 
we need to solve the eigenvalue problems for all potential 
cases, like the MNEP, which is computationally expensive. 

To analyze the relation between the observation vectors in 
{v’s }l=i ^nd the eigenspace associated with the minimum 
eigenvalue of we present the following theorem. 

Theorem 3: The normalized eigenvector associated with 
Ai'=( of ^k 

= arg min||$fcx ||2 (15) 

l|x||2 = l 

(k) 

and one sufficient and necessary condition of AA ' > 7 is that 
for any nonzero normalized vector x £ K", 


where is the eigenvector associated with Ai(A), and 
Afe_i(A) > Afc(A) = A„(A). 

Equation m implies that to meet the requirement on 
An(= aI^^), i.e. An > 7, the square summation of the 
projections of {Psk}^=i any non-trivial subspace of K" 
should be larger than 7. Therefore, it is reasonable that when 
determining the fc-th sensing location we select the observation 
vector 93 * that has the largest projection onto the non-trivial 

k 

subspace onto which the square summation of the projections 
of {(/Sg is minimum. 

Therefore, if fc < n we can select the fc-th sensing location 
whose observation vector has the largest projection onto the 
null space of It is easily found that the null space 

of is exactly the minimum eigenspace of i.e. 

null($fc_i) = Uk:n{^k-l)- If k > n, the square summa¬ 
tion of the projections of {<Ps }i=i oi^^o the /j,„ subspaces 
span(u(f_r^^^_,_;^), span(u(fj'^^^_,_ 2 ),..., and span(ul^“^') are 
equal and minimum. Generally, A^“^ is a simple eigenvalue 
and = 1. If > 1 we can select the fc-th observation 
vector that has the largest projection onto the spanned subspace 

..., which is exactly the 

minimum eigenspace of 

In summary, for all fc > 1, we can determine the fc-th 
sensing location by maximizing the projection of the obser¬ 
vation vector on the minimum eigenspace of the current dual 
observation matrix 

For simplicity, we denote 

z = (17) 


||4*fcx|||>7 (16) 

Proof: See Appendix B. ■ 

For any k > n, equation ( fTS] ) shows that span(ul^~^() is 
the subspace onto which the square summation of the pro¬ 
jections of {v’s minimum. However, if the minimum 

eigenvalue of i.e. AA 7 is a multiple eigenvalue with 

multiplicity /i„, considering equations ( 1 ^ and ( |T5] l we can 
hnd that the optimization problem in has /i„ different 


where U 2 ^~^\ ..., and let Zi be the 

i-th component of z. It is clear that the square of the projection 
of onto the minimum eigenspace of is 


Ck 


E 




= < 


2 = n —/iri + 1 



^ 2 =n—/iri + 1 


if k < n 

if k > n 
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Let Pfe_i be a projection matrix which can project any 
vectors in M" onto the minimum eigenspace of ^k-i- When 
k < n, the minimum eigenspace of is the null space of 

Then, we can hnd that 


Pfc—1 — Inxn iP'/c—1 

where Rfe_i = orth($^_j^) whose column vectors are 
obtained from the Gram-Schmidt Orthonormalization of all 
the column vectors of i.e. the vector group {^Ps }i=i- 

When fc > n, it is clear that 

where Then we 

can obtain that 

Ck = \\Pk-iP>J\l 


Apparently (^k can be obtained if €>fc-i and are known, 
i.e. (k satishes the hrst condition of the new criterion used to 
replace A„. If 7^ 0 for any k < n, the projection of ip^^ onto 
null($fc_i) is nonzero; therefore, is independent with all 
the vectors in {Ps }i=i^ which implies that (^k satishes the 
second condition. Next, we utilize the following theorem to 
show that (k satishes the third condition. 

Theorem 4: Given any observation matrix ^k = [v’si; 

..., S and its corresponding dual observation ma¬ 
trix Vhfc = with a nonincreasing eigenvalue sequence 

If fc < n and ^k is full row-rank, then 


\(^) _ 

'^k — 


Cfe 


k-1 

1+ E ■ 




-A 


(fc) 


Xfl = A«,... = A« = 0 


(18a) 

(18b) 


(k) (k—1) 

and is monotonically increasing w.r.t. Xj._i 
If fc > n, then 


A 


(fc) 

n—+ 1 


= A 


(fc-i) 
n—/X ti + 1 


Cfe 


1 + 


n — ^n 

E 


2 = 


y (~ 1)_ 


-Mn + 1 


(19a) 


\(^) _ \(^) 

^n-/in+2 ~ ^n-Ai^+3--- 




(19b) 


and for any M > n, A^^^ is monotonically increasing w.r.t. 
(^k for all fc < M. 

Proof: See Appendix C. ■ 

This theorem shows that A„ is monotonically increasing 
w.r.t. C,k- Accordingly, WCEV is monotonically decreasing 
w.r.t. (jfc. It is clear that C,k can be another choice of the criterion 
instead of A„ to optimize the fc-th sensing location. Therefore, 
to hnd the fc-th sensing location, we maximize the magnitude 
of the projection of onto the minimum eigenspace of 
vEfe-i instead of maximizing the minimum nonzero eigenvalue 
of The greedy sensor placement algorithm, which we call 
maximal projection on minimum eigenspace (MPME), is given 
in Algorithm]^ 

The fc-th sensing location is obtained from the optimization 
problem in step 2(b) or step 3(a). To solve the optimization 
problem, we traverse all the unselected observation vectors 


Algorithm 2: maximal projection on minimum eigenspace 

Input: $ = e 

Output: $ e S , M 

1 Initialization: M = {1,2,..., TV}, 5 = 0. 

2 Determine the first n — 1 sensing locations: 

(a) Set $0 = [ ], Po = Inxn and fc = 1. 

(b) Sk = arg max \\Pk-iPi\\l. 

ieN'\S 

(c) Update: 5 = 5 U {sk}, ^k = [^Ei 

Rfe — C)rtll(^^ ), Pk — Inxn RfeR/j; . 

(d) Set fc = fc -I- 1 and repeat step (b-c) until k = n. 

3 Determine the remaining sensing locations: 

(a) Sk = arg max \\Pk-iPi\\l. 

ieAr\s 

(b) Update: 5 = 5 U {s*}, ^k = [^Ei 

^l^k = UAfcU^, Pfc = U„UE 

(c) If aI*^ > 7 return S, M = k, and $ = ^k, else set 
fc = fc -f 1 and repeat step (a-b). 


and hnd the one that maximizes (jk- Meanwhile, if fc > n, 
we check the constraint in 0 after each sensing location is 
determined. If the constraint is satished, stop the algorithm. 


C. Discussions about MNEP & MPME 


In both algorithms we need to solve the optimization 
problem in step 2(b) or step 3(a). In Algorithm to solve 
the optimization problem, we need to evaluate the minimum 
nonzero eigenvalues of all — fc -f 1 possible dual observation 
matrices, while in Algorithm we compute the projection of 
N — K + 1 possible observation vectors onto the minimum 
eigenspace of ^k-i- The computational cost to hnd the 
eigenvalue of an ti x n matrix is much more expensive than 
projecting one vector in M" onto a known subspace. Therefore, 
the MPME is computationally much more efficient. 

Equations ( |18a[ ) shows that aE is monotonic increasing 
w.r.t. (jk and decreasing w.r.t. zf for all z < fc < n. Generally 
An^ is a simple eigenvalue when k > n, i.e. = 1, and 
(|19a|l-([T9b|l can be simplihed as 


A« = Al'=-i) + 


Cfc 


n—1 

1 + E ■ 

2 = 1 , 2 : 4^0 


( 20 ) 


xfc-i; 


-A 


(fc) 


Equation ( |20l l shows that is monotonic increasing w.r.t. 
(k and decreasing w.r.t. zf for all z < n < fc. Therefore, the 
MNEP algorithm prefers to select the observation vector with 
large (k and small zf, which may not be with the largest (k 
but achieves a balance between a large (k and a small z?. It is 
clear that X;r=i E = llv’sfc II2 and XlEi II‘Es, lli^= Er=i EE 
therefore, small zf means small increment of aE from a[^ 
for all z < fc when k < n, and all z < tz when k > n. 
Additionally, ( |18a| i and ( [20l i show that both a[,^^ and aE^ are 
monotonically increasing w.r.t. Therefore, selecting the 

observation vector with small zf leads to a relatively smaller 
aE^ and An^^\ where fc_|_ > fc, and the MPME probably 














100x20 Gaussian random matrices 


100x20 Gaussian random matrices 






Fig. 1: The performance comparison between the MPME and other sensor placement methods for Example^&nd Example^ For the MPME 
and the MNEP we do not consider the stopping criteria but show the mean WCEV index and the mean MSE index w.r.t. the number of 
sensor nodes, increasing from 20 to 40. For the SparSenSe method, we do not consider the minimum number of required sensor nodes but 
determine the sensing locations like the convex relaxation method by finding the indices of the M largest elements of w*. 


(k) 

outperforms the MNEP in terms of finding the largest An for 

k > n. 

In MPME we maximize Cfe, which can guarantee a large 
minimum nonzero eigenvalue of the updated dual observation 
matrix. In MNEP we directly maximize the minimum nonzero 
eigenvalue of the current dual observation matrix. Therefore, 
both algorithms guarantee that An is not much smaller than 

(k) \ 

XI ' for i < n and k ^n, which implies that both algorithms 
can guarantee that \I/^. is well-conditioned when k^n. Here, 
k^n means k is slightly larger than n. 

In both algorithms, the minimum number of required sensor 

(k) 

nodes is determined by judging whether AA > 7 is satished 
after each sensing location is determined when fc > n. It 
is clear that in both algorithms the constraint in Q is only 
considered in step 3(c), i.e. the last sub-step. In other words, 
the constraint in Q is only used to judge whether the number 
of sensor nodes is enough. 

Remark 2: We can change the constraint in Q to other 
constraints described by MSE or det(S), if other measures 
are more interesting. Correspondingly, in step 3(c) we check 
the new constraint. 

IV. EFFECTIVENESS OF THE MPME ALGORITHM 

In this section, we provide four examples to illustrate the 
effectiveness of the MPME. The detailed analysis of the 
comparisons with the state-of-the-art, i.e. the convex relaxation 
Q, SparSenSe Q, and ErameSense Q will also be presented. 


Example 1: $ € ]^ioox20 ^ Gaussian random matrix 
with independent components ipij ~ ^^(0, 1), and the vari¬ 
ance of the sensor noise = 1. 

Example 2 : $ G Kioox20 ^ Bernoulli random matrix 
with independent components ipy ^ ^(1,0.5) with ^ rep¬ 
resenting the Binomial distribution, and the variance of the 
sensor noise = 1. 

The mean WCEV index and the mean MSE index of 200 
Monte-Carlo simulation run results for the two examples are 
given in Eig. The optimization problems 0 and 
are solved using the SDPT3 solver p5) and CVX toolbox 
0, respectively. Actually, the SDPT3 solver is used as the 
computational engine of the CVX toolbox. 

Eor the i-th (1 < j < 200) simulation run, we determine the 
observation matrix (20 < k < 40) based on the random 
signal representation matrix and obtain the following 

MSE index and WCEV index from Q and (|^, respectively; 

MSE^*^ (21) 

WCEVi*^ = A„,ax(('5'i*V) (22) 

where Then, the mean MSE index and 

the mean WCEV index of 200 Monte-Carlo simulation run 
results are given by 

_ 1 ^200 /,N 

MSEfc = - V MSE^ 

200 ^i=i 

_ 1 ^200 /,N 

WCEVfc =-V WCEV[*^ 

200 ^i=i 
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It is shown in Fig. [T] that for both examples, the MPME 
outperforms all the other methods in terms of the mean WCEV 
index and the mean MSE index. If the same number of sensor 
nodes are used, the MPME can provide the best results of 
linear inverse problems as compared with the other methods. 

If we set the WCEV index threshold to be 0.3 (i.e. 
7 = 10/3) for Example the top left figure shows that 
the minimum number of required sensor nodes Mmnep = 
A^MPME = 23, Mconvex. .relaxation — .^SparSenSe — 28, and 
^FrameSense = 37. If we Set the MSE index threshold to 
be 1.5, the top right figure shows that the minimum number 
of required sensor nodes Mmpme = 23, Mmnep = 25, 

Mconvex.relaxation — M^gparSenSe — 26, and MpranieSense — 

36. Therefore, to meet the accuracy requirement, the proposed 
MPME algorithm requires the least number of sensor nodes. 
Eor the Bernoulli random data matrix we can easily obtain the 
same conclusion from the bottom two figures. 

Fig. 0 shows that MPME outperforms MNEP, which has 
been analyzed in Section III-C Additionally, we find that for 
all the five algorithms, the improvement of WCEV and MSE 
are increasingly insignificant with an increase in the number 
of sensor nodes. It means that the influence of additional 
sensor observation declines and its location is not so critical 
as the previously determined sensing locations. It is clearly 
shown in Eig. that MPME is much better than the convex 
relaxation method, SparSenSe, and ErameSense when the 
number of sensor nodes is very limited, i.e. the MPME method 
significantly outperforms the state-of-the-art in finding the 
critical sensing locations, which is analyzed as follows. 


index of the convex relaxation method are much larger than 
those of MPME, respectively. 


100x20 Gaussian random matrices 



Fig. 2: The mean condition number of the dual observation matrix 
for Example^ Here is obtained from the five 
sensor placement algorithms. 

In practice, the solution of the convex optimization problem 
w* € [0,1]^, is mapped into {0,1}'^ to find the sensing 
locations. The largest M elements of w* are mapped to 1 
and other elements are mapped to 0. In such a mapping, the 
singularity of is not considered. The number of sensor nodes 
is nearer the dimension of the estimated vector a, giving a 
higher probability of fP being ill-conditioned. However, the 
proposed MNEP and MPME algorithms can guarantee a large 
minimum nonzero eigenvalue of and accordingly a well- 
conditioned ’S'm for M > n. 


A. Comparison with convex relaxation 

Fig. 0 shows that the mean MSE index and the mean 
WCEV index of the convex relaxation method are much larger 
than those of the MPME method when the number of sensor 
nodes is slightly larger than the dimension of the vector to be 
estimated, i.e. when fc ^ n = 20 in this example. Comparing 
the left two figures with the right two figures in Eig. [T] we 
find that when using the convex relaxation method, the WCEV 
index (i.e. the maximum eigenvalue of see equation 
contributes the main part of the MSE index (i.e. the trace of 
see equation ([2T]i), especially when the number of sensor 
nodes is small. It means that the maximum eigenvalue of 
is overwhelmingly larger than the others when k ^n.ln other 
words, the minimum eigenvalue of is much smaller than 
the other eigenvalues, which implies that '4'fe is ill-conditioned, 
and hence the estimated vector a is inaccurate. 

The mean condition number of Monte-Carlo simulation 
result for Example [T]is shown in Eig.|^ Eor i-th (1 < i < 200) 
simulation run, the condition number of is denoted by 
The mean condition number is given by 


Kfc = 


—y 

200 


200 


Fig. 1^ shows that for k < 23, fPfc obtained from the convex 
relaxation method is ill-conditioned, whereas obtained 
from MNEP or MPME is well-conditioned. Accordingly, as 
shown in Eig. [T] the mean MSE index and the mean WCEV 


B. Comparison with SparSenSe 

It is claimed that SparSenSe can determine the minimum 
number of required sensor nodes by utilizing the sparsity of 
the variable w in ( [TOl l. With a predefined threshold r and the 
solution of optimization problem ( [T0| l, i.e. w*, if w* < r set 
w* = 0. The sensing indices then exactly correspond to the 
nonzero entries of w* and the number of nonzero entries is 
the minimum number of required sensor nodes. This strategy 
works well for the example in § in which $ G and the 

largest 3 entries of w* are much larger than other elements. 
We can set t as a small value and easily find the largest 3 
entries corresponding to the selected sensing indices. 

However, this strategy is ineffective if the dimension of the 
estimated vector a is large. In the pervious two examples, 
w G with at least 20 nonzero entries is not sparse. We 
introduce another example to illustrate this problem. 

Example 3: $ G ^ Gaussian random matrix 

with independent components (pij ~ M^(0,1). The variance 
of the sensor noise cr^ = 1, and the maximum acceptable MSE 
index 7' = 1.5. 

In Example W ;g> n guarantees that the decision variable 
w is sparse. Eig. shows that if r = 0.02, 98 sensor 
nodes are selected. If t is set as 0.05 and 0.1, then 71 and 
31 sensor nodes are selected, respectively. We can see the 
result of SparSenSe from Eig. that 31 sensor nodes can 
guarantee that the MSE index is less than the maximum 
acceptable MSE index. However, the minimum number of 
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1500x20 Gaussian random matrix 



Fig. 3: Selected sensing indices with SparSenSe for Example 3. 


1500x20 Gaussian random matrix 



Fig. 4: The MSE index of a estimated from 20 to 40 sensor 
observations for Example 3. 


required sensor nodes is 23. For this example, if we set 
T = T* G [0.1129,0.1140], the 23 largest elements of w* 
will be selected. In practice, however, the optimal threshold 
T* is a prior unknown; therefore, utilizing the sparsity of w to 
determine the minimum number of sensor nodes is ineffective. 

In practice, we can set M = n and select the sensing indices 
that correspond to the M largest entries of w*. Then, we check 
whether the accuracy (i.e. WCEV index or MSE index) is 
acceptable. If not, increase M, reselect the sensing indices and 
recheck the accuracy until the accuracy is acceptable. Using 
this strategy for Example [^ we can easily hnd the minimum 
number of required sensor nodes M = 23. 

Like the convex relaxation method. Fig. [T] and Fig. [^ shows 
that the solutions of MPME are much better than those of 
SparSenSe when k ^ n. The reason is that the sensing 
indices corresponding to the k largest elements of w* cannot 
guarantee a well-conditioned fFfc as shown in Fig. [^ 


C. Comparison with FrameSense 

It is apparent from Fig. and Fig. [^ that FrameSense 
provides the worst results for the first two examples in which 
$ is not an equal-norm frame, i.e. the norms of the rows of 
$ are not equal. For these cases, minimization of the frame 
potential in 0 will select the rows of $ with small norms 
to construct the observation matrix $. FrameSense prefers to 
drop the rows with large norms Q. It is easily found that 



tr('5') = ^. A, 


Compared this equation with the MSE in Q, we conclude 
that small norms of the rows of the observation matrix $ 
lead to a large MSE of the estimated vector a. Erom another 
perspective, we hnd that the smaller the norm of for 
i = the smaller is the signal-to-noise ratio of the 

measurement model. Therefore, FrameSense is only suitable 
for the case where $ corresponds to an equal-norm frame. 

Actually, even if $ corresponds to an equal-norm frame, 
the proposed MPME algorithm still outperforms FrameSense, 
which will be illustrated by the following example. 

Example 4: $ S ]^ioox20 ^ random matrix with normal¬ 
ized rows, whose i-th row ipj = and cf)^ G is a 

random vector with independent components ^ J^{0, 1). 
The variance of sensor noise = 1. 

The mean WCEV index and the mean MSE index of 200 
Monte-Carlo run results are shown in Eig. [^ The two hgures 
show that ErameSense outperforms the convex relaxation 
method and SparSenSe when the number of sensor nodes is 
small. The right hgure shows that the five methods except 
MNEP provide almost the same mean MSE indices when the 
number of sensor nodes is large enough, which indicates the 
effectiveness of ErameSense in pursuing the minimum MSE. 

However, like the convex relaxation method and SparSenSe, 
in Eig. [^ both WCEV and MSE of ErameSense are much 
larger than those of MPME when the number of sensor nodes 
is small (e.g., fc = 20 or 21), which indicates that ErameSense 
cannot guarantee a well-conditioned when k^n. 

Additionally, Eig. [^ shows that MPME requires the least 
number of sensor nodes to meet the accuracy requirement, 
and that the required sensor nodes of ErameSense is less than 
those of the convex relaxation method and SparSenSe. 

If $ corresponds to an equal-norm frame, the minimum 
frame potential of # implies the minimum MSE, but the 
“worst-out” strategy in ErameSense cannot find the optimal 
solution, and even cannot guarantee that fP is well-conditioned 
if the available sensor nodes is limited. The solution of Frame- 
Sense is near-optimal because of the sub-modularity of the 
cost function; however our MPME algorithm still outperforms 
ErameSense in the four examples. Hence, our future work may 
focus on exploring the reasons why MPME outperforms the 
near-optimal solution from a theoretical perspective. 


D. Local optimization 


The four examples show that the current methods are not 
suitable for the cases that the number of sensors is small. This 
drawback can be overcome by a computationally expensive 
technique, i.e. the so called local optimization technique Q. 

Definition 2 (Local optimization): Eor a given set of sens¬ 
ing locations S, exchange one-at-a-time all the sensing loca¬ 
tion in S with each available candidate location in J\f/S to 
re-locate the sensor nodes at new position that can further 
reduce one criterion of interest (e.g., MSE or WCEV) until 
there is no further decrease. 

This technique is similar with Eedorov’s exchange algorithm 
1361, and Wynn’s algorithm p7) . It has also been discussed 
in & 0 - For any results of local optimization, replacing 
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Fig. 5: The mean WCEV index and mean MSB index of a estimated from 20 to 40 sensor 


observations for Example 


local minimization of WCEV 100x20 Gaussian random matrices 



local minimization of WCEV 100x20 normalized Gaussian random matrices 



local minimization of MSE 100x20 Gaussian random matrices 




Fig. 6: The performance comparison between MPME and five methods with local optimization for £jfampfe|^and Example^ 


any selected sensing location by any unselected one cannot 
improve the solution, which is called 2-opt. 

We apply the local optimization technique to the solutions 
obtained from the hve methods for the four examples. Fig. 
shows the mean WCEV indices and the mean MSE indices 
of the improved sensor conhgurations, together with those 
directly obtained from MPME, i.e. without local optimization. 
In Eig. the solutions of the convex relaxation method, 
SparSenSe, and ErameSense are remarkably improved by the 
local optimization, especially when the number of sensor 
nodes is small. However, the solutions of MPME almost 
have no improvement. Nevertheless, the two top figures show 
that the solution of MPME without local optimization still 
outperforms all the other solutions with local optimization in 
terms of both indices. 


The bottom right figure shows that with local optimization, 
the solution of ErameSense for normalized Gaussian random 
matrices are sightly better than the solution of MPME without 
local optimization. However, the local optimization is compu¬ 
tationally very expensive, and the MPME provides the best 
result amongst the hve methods for all the other cases. How 
the local optimization affects a given sensor conhguration and 
which types of sensor conhguration can be greatly improved 
by local optimization are still open problems. 


Additionally, Eig. shows that for all cases, to meet the 
accuracy requirement, the solutions of MPME without local 
optimization require the least number of sensor nodes. Erom 
this perspective, MPME without local optimization outper¬ 
forms the state-of-the-art with local optimization. 
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TABLE I: The computational effort of the five sensor placement methods 


Convex Relaxation 

SparSenSe 

FrameSense 

MNEP 

MPME 

O(icA^) 


O(A^) 

O(NMn^) 

O(NMn^) 


V. COMPUTATIONAL COST OF THE MPME ALGORITHM 

In this section, we compare the computational cost of 
MPME with that of the state-of-the-art. 

The computational effort of the convex relaxation method 
is 0{icN^) 0- The convex optimization problem is solved 
using the interior-point method and is the iteration number 
Typically, the iteration number is of a few tens ||T]. 

Similar to the convex relaxation method, the computational 
effort of SparSenSe is 0{isN^) where ig is the iteration 
number of solving the convex optimization problem ( [TOl i. 

When using ErameSense, N—M rows are removed from 
It costs O [{N — A: -f 1)^) to determine the fc-th removed row. 
Since M N, the total cost of ErameSense is ~ 

Einding the eigenvalues of ’4'^ costs 0(n^) operations. 
When we find the fc-th sensing location via MNEP, the 
main computational cost is to solve the minimum nonzero 
eigenvalue maximization problem in which N — k + 1 eigen¬ 
value problems are solved. The computation cost is O (^Nn^). 
Therefore, to determine all the M sensing locations via MNEP, 
the total computational effort is 0{NMu?). 

To determine the fc-th sensing location via MPME, the main 
computational cost is attributed to the optimization problem 
Sk = arg max ||Pf;_i(pj||2 which costs O [{N — k + l)n^). 

iGAf\5 

Hence, finding all the M sensing locations costs 0{NMr?). 
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Fig. 7: The mean computation time of the five sensor placement 
algorithms for A x 20 Gaussian random matrices where N is from 
100 to 1000. In the simulations, the number of available sensor nodes, 
i.e. M, is set as 20 for the convex relaxation method, FrameSenSe, 
MNEP and MPME. The computation time is estimated by the 
MATLAB tic-toc commands. The simulation is run in a laptop with 
a 2.4GHz Intel 13-31 lOM processor. The mean computation time is 
the mean value of 50 different simulations, and for each simulation, 
^ with independent entries ifiij ~ N{0, 1). 

We summarize the computational efforts of the five methods 
in Table The mean computation time of 50 simulation run 


results are shown in Fig. [7] Both Table and Fig. [7] show that 
MPME is computationally the most efficient one amongst the 
five algorithms if N ^ M > n. 


VI. Conclusions 

Sensor placement for linear inverse problems is an inter¬ 
esting but challenging combinatorial problem. The optimal 
solution can be solved by the exhaustive search and branch- 
and-bound methods 0, ng, but the methods are both im¬ 
practical due to the extremely expensive computational cost, 
especially for some large scale problems. Therefore, in the 
last decade, many works have focused on finding an effective 
suboptimal solution via computationally efficient algorithms. 
To the best of our knowledge, the proposed MPME algorithm 
is computationally one of the most efficient sensor placement 
algorithms. 

Our proposed MNEP and MPME algorithms select the sens¬ 
ing locations one-by-one. In this way, the minimum number of 
the required sensor nodes can be readily determined. Different 
with many popular methods, MNEP and MPME can guarantee 
that the dual observation matrix is well-conditioned when the 
number of sensor nodes is small and even near the dimension 
of the estimated vector. 

The sufficient and necessary condition of meeting the re¬ 
quirement on WCEV is shown to be that the square sum¬ 
mation of the projections of all selected observation vectors 
onto any non-trivial subspace of K" is large enough. The 
proposed MPME algorithm determines each sensing location 
by maximizing the projection of its observation vector onto the 
subspace onto which the square summation of the projections 
of all selected observation vectors is minimum. 

We perform Monte-Carlo simulations to compare the MNEP 
and MPME algorithms with the convex relaxation method 0, 
SparSenSe 0, and FrameSense 0- Based on the simulation 
results, we conclude that amongst the five methods: 

• To meet the accuracy requirement, the solution of MPME 
requires the least number of sensor nodes; 

• The MPME algorithm provides the best solution in the 
sense of minimum WCEV or minimum MSE, especially 
when the number of sensor nodes used is small; 

• MNEP and MPME work well when the number of 
available sensor nodes is very limited, while the state- 
of-the-art cannot; 

• For the general cases, the solution of the MPME without 
local optimization is even better than those of the state- 
of-the-art with local optimization. 

To encourage future works, we provide all the Matlab code 
used in this paper, which can be found from IEEE Xplore or 
https://github.eom/CJiang01/SensorPlacement.git 








13 


Appendix A 
Proof of Theorem|2] 

Proof: The spectrum decomposition of is 

(23) 

where Un^^] is an orthonormal matrix, 

(k) 

and Afe is a diagonal matrix whose diagonal entry A -- = 

(k) 

A - . Then, we can obtain 

Afe = 

from which ( [T^ can be directly found. ■ 

Appendix B 
Proof of Theorem[3] 

To proof Theorem we need the following lemma. 
Lemma 5 (Courant-Fischer Minimax Theorem): If A € 
I^nxn symmetric, then for i = 1 : n, 

Ax 

A; (A) = max min —=— (24) 

dim(U)=i O^xSU X-'-X 

Proof: See the proof of Theorem 8.1.2 in p^ . ■ 

Next, we prove Theorem]^ 

Proof: Since x^^tP^x = x"'’$J$feX = ||#/jx|| 2 , from 
Lemma |5] we have 

" min ||$fcx||^ (25) 

l|x||2 = l l|x||2 = l 

Then, from ( [T4l l and ( |25l l, we can obtain 

Next, we show the sufficient and necessary condition of 

(k) 

An > 7 is that for any nonzero normalized vector x S K", 

ll^fexll^ > 7. 

Sufficiency: Considering (fT4ll, if ||$fcx ||2 > 7 for any 

ffe') 

nonzero normalized vector x G ffi”, we can obtain AA ' > 7. 

Necessity: Since A^^^ > 7, considering ( |25| ) we conclude 
that |i$fcx ||2 > 7 for any nonzero normalized vector x G K". 


Appendix C 
Proof of Theorem|4] 

Considering 0 and ( | 2 ^ , we can obtain another descrip¬ 
tion of \Pfc, i.e. 

=U('=-i)(Afe_i-fzz^(U('=-i))T (26) 

Let Afc = Afe_i +zz^. From ( |2^ and ( |26l l, we find that tPs;, 
Ak, and A^ are mutually similar, which implies that they share 
the same eigenvalues. 

To proof Theorem |4| we need the following four lemmas. 
Lemma 6: If Zi = 0, A^ is an eigenvalue of A^, and 
the corresponding eigenvector is 

Proof: As Zi = 0, we can obtain 

AfeOi = Afc-iOi + zz'^Oi = 

(k—1) 

It is clear that A^ is an eigenvalue of Ak and is the 
corresponding eigenvector. ■ 

Lemma 7: For all i G I = {i|l < i < n, 7 ^ 0}, denote 
by Vi the eigenvector of Ak associated with a|^\ For all j G 


J = {i|l < j < n,Zj = 0 }, removing the j-th row and j-th 
column of Ak-i and Ak yields Ak-i and Ak, respectively, 
and removing the j-th entry of and z yields and z, 
respectively. Then, 

AfeVi = Afc_iVi -f zz'^’-Vi = (27) 

Proof: For all j G J, removing the j-th row of the 
following equation directly yields ( |27] i. 

AfcVi = Afc_iVi -f z (z'^Vi) = ApVi 


Lemma 8: If Ap = Ai^^Vi is an eigenvalue of Afe_i 
with multiplicity ^i and 7 ^ 0 , then is an 

eigenvalue of A^ with multiplicity /ii — 1 . 

Proof: Since V 7^ obtain 

rank(Afe — Ap = dim span(Afc — Ai^ 

= dim span([Afe_i - z]) 

= n — /ii -I- 1 


where for any matrix A, dim span(A) represents the di¬ 
mension of a linear space spanned by all the column vectors of 
A. Therefore, is an eigenvalue of Ak with multiplicity 

/ii - 1 . ■ 

Lemma 9: If A„ is a multiple eigenvalue of Ak-i with 
multiplicity /i„ and V ^ ^ 

for all i satisfying Zi 7 ^ 0 . 

Proof: It follows from Theorem [T] that 


Hence, 


A(fe-i) ^ ^(/c) ^ all 1 < / < n 


Cfc-i) , >(/c) ,(fe-l) 

'^n-fin + l — ^n-/x^ + l — ^n-fLr, 


(28) 

(29) 


Then, considering Lemma we can obtain 


A, 


(fe) 


n — fj>n + t ^ ''^n — fln + 1 


, (fc- 1 ) _ \(fc-l) _ \(fe) _ \(/c) /OQ-, 

'^n-u^+1 — — '^n-k,„+2 ~ 


Denote the multiplicity of w.r.t. Af._i by /t, where 

/i > 1. Hence, 

.(k-i) 


, (fc — 1 ) ,(k—l) _ , I 

'^n — fln — ^ ^n — fLn—fL-\-l —^-\-2 


= ... = AiV^i (31) 


If V 7 ^ 0> considering Lemma^ is 

an eigenvalue of Ak with multiplicity /r — 1. Therefore, if 


\(fe-i) _ \(fc) 

'^n — fLn '^n — fLn-\-l 

dfc) 


the multiplicity of w.r.t. A^ is /t — 1 and 


A, 


= A^'^^ 

71 —/^ri + 1 — 


(k) 




(k) 


< A, 


(fc) 


n—/J.+3 ^ ^'n—/i^ —/i+2 


(32) 


( 33 ) 


From we can obtain 


which obviously contradict with ( [28] l. Hence, A „7 


^n—/i^ + l 


Consequently, if Ei=n%„-,x+i V 7 ^ 0 , considering 


and ([ 30 I 1 we can obtain 


A, 


(fc-i) 


ri-iiL + t ^ + l — V 7 ^ F 


(k) 


< A 


(fe-i) ^ ^(fc-i) 
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else (Er=:-„+i"' = 0) 


,(fc-l) 


< A 


(fc) 


< a: 


(fe-i) ^ ^(fc-i) 


n—/j,Ti + l — — 


j <n- fi„ 


which implies this lemma. ■ 

Next, leveraging the four Lemmas, we are ready to prove 
Theorem HI 

Proof: Let A„ be the minimum eigenvalue of 
(Af._i) with multiplicity (> 1). If fc < n, = 0 and 

/J-n = n — k + 1. From Lemma we can directly obtain ( |18b| ) 
and ( |19b| i. 

If Er=n^„+i — 0’ 1^ Is obvious that (k = 0. According 
to Lemma ^ we can easily obtain ( |18a| l and ( [T9al l. 

If X]"=n-/i +1 recording to Lemma we can 

obtain 


tA 

(Afc-1 - A„_ 




Lemma 


0' 


+ z(z'^ 


v„-^„+i) = 0 (34) 
I is nonsingular. 


Jean guarantee that Afc_i — A„ 
and therefore from ( [34| ), we find that 0. Then, 

left multiplying z"'"(Afc_i —to both sides of ( |34| ) 
yields 


z'^v„_^^+i (l + z^(Afc_i - ^z) = 0 

Hence, 

1 + z'^(Afc-i - A|fi^^+il)-iz = 0 

from which we can directly obtain P9a| l. If fc < n, = 0 

and Hn = n — k + 1, substituting them into ( |19a| l yields ( |18a| l. 

It is clear that equation ( |19a| l is a general description of 
(|18a|i. From (|19a|i, we can obtain 


/■ _ \(^) _ 


n-fin /\{k) _ \^2 

^ 2^ A (/C-l) _ A (fc) 


(35) 


Taking the derivative of w.r.t. A„_^ we can obtain 
that 


dCk 


n — ^ri 


dA, 


(fc) 
n— 


= 1+ E 




^n— 


2 = l,2i^0 


(A(fc-i) _ ;^(fc) 


rL—fLn-\-l / 


Since A^^ > a|j*L for alH < n — /r„, it is obvious that 

dCfe/dAl^^ > 1. Therefore, 0 < dA^j^ _|_;^/dCfc < 1, which 
(fc) ^ 

implies that is monotonically strictly increasing w.r.t. 

C,k, i-e. 

Cfc o (36) 


where ‘a b’ means that b is monotonically increasing w.r.t. 
a. 

Taking derivative of both sides of ( [T5] l w.r.t. 
some operations, we can obtain 


and with 


dA, 


n—/j.Ti + 1 


\(^) 2^ 

—/T n +1 ^ n—/T „ 


dA, 


(fc-i) 


n — fln 




where 


n — Uri 


= E 


n — / 




/xlfc-i) \(fc) 




Since A 


(fc) 

n—/Xn + l 


can obtain that dA, 

v(fe) 


> 0 and x[^ > 0 for all * < n — /i„, we 

> 0, which implies that 


(fc) 


_Li is monotonically increasing w.r.t. 


i.e. 


A(fc— 1) \(^) 

^n-fin ^n-/x^ + l 


(37) 


If k < n, we have = n — k 1 and therefore, A 

(fc— 1) 

monotonically increasing w.r.t. Xj,_^ for all k < n, i.e. 


(fc) 


IS 


A'-L-'l « A'« 


Taking the derivative of both sides of 
and with some operations, we can obtain that 


(38) 


wrt 

W.r.t. 


dA 


(fc) 

n—/Tn+l 


n — lln 

1+ E 




-Mn + i 


dA 


(fc-i) 
n—/iri+l 






(A 


(fc-1) 


-a: 


(fc) 


-Mn+i 


Since A^* for all j < n — /i„, we can obtain 


/dAi'l-^ 


> 0, which implies that A„_ 


that 

(fc—1) 

is monotonically strictly increasing w.r.t. therefore. 


(fc) 

n — fLn-\-l 


A(fc—1) 

^n-/irr + l ^n-/irr + l 

For n < k < M, generally /x„ = 1 and from ( [36l l and 
we can obtain that 


(39) 


Cfc o A^'=) o o ... o Xi^^ 

If Hn > 1, from ( |36] ) and ( [JT] ! we can obtain that 

Cfc ^ o A)f_l))^_,_2 o ... o 


Since A 


(fc+fci„-i) ■ 


is a simple eigenvalue, considering 


( |4T] ) we can find 

Cfc ^ o ... o a(^) 


(40) 

(41) 
and 

(42) 


From (|40ll-(|42ll, we can obtain that 

Ai”^ ^ A^^^) (43) 

For k < n, the multiplicity of A„ w.r.t. is n—k+1, 

i.e. /i„ = n— k + 1. Then, considering ( |36] l, ( |38] l and ( |4^ , we 
can obtain that 

Cfc o A^ o Ai'+y) o ... o Ai") o A(^^) (44) 

In summary, from ( |40l l, ( |42| ) and ( |44| l, we can conclude that 
for any M > n, A^^^^ is monotonically increasing w.r.t. Cfc for 
all k < M. m 
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